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ABSTR*\CT 

Protocols were obtained from 22 subjects as they 
discovered the c^^'Jitions under which equilibriiim is obtained on a 
balance beam by predicting and observing the outcomes of a series oi 
problems. The interviews revealed that subjects useu a variety of 
heuristics to make predictions once they had isolated the two 
relevant features of the problem, weight and distance, but before 
they could appropriately relate them in one procedure. These 
heuristics included instance-based reasoning, qualitative estimation 
of distance, and the use of quantitative rules of limited generality. 
Variability in the use of these heuristics by individuals suggests 
that learning to understand the balance beam is not best described in 
terms of a simple rule acquisition theory, and that reasoning from 
instances plays a more important role in inferring a physical 
relationship than previously thought. (Author) 
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Abstract 

Protocols were obtained from 22 subjects as they discovered the conditions under 
which equilibrium's obtaired on a balance b^am by predicting and observing the 
outcomes of a series of problems. The interviews revealed that subjects used a 
variety of heuristics to make predictions once they had isblat>^ the two relevant 
features of the problem, weight and distance, but before they could appropriately 

9 

relate them in one procedure. These heuristics included instance-based 
reasoning, qualitative e^gtimation of distance, and the use of quantitative rules 
of limited generality. Variability in the use of these heuristics by individuals 
sugl^j^ts that learning to Understand the balance beam is not best described in 
terms of a simple ru7 " acquisition theory, and that reasoning from instances 
plays a more important role in inferring a physical relationship than previously 
thought. 



In thi& paper, we are concerned with the way in which people develop an 
understandxng of pnysxcal concepts that have the following properties: (1) two 
separable variables are involved^ (2) those variables must be measured or 
Quantified in some way, and (3) the measurements may be combined according to 
some rule, yielding a. third quantity or construct that allows prediction of what 
will happen for any combination of the two variables. Examples of such concepts 
include density, the size of shadows, and the ballance beam. 

The balance beam is the particular pnysxcal concex^t that we have cnosen to 
study# fl balance beam consists of a bar placed on. top of a fulcrum or balance 
poxnt. Weitjhts may be placed on both sides of the fulcrum. The effectiveness of 
a weight in causing the beam to tip down is determined by the product of the 
weight (w) and its Distance from the fulcrum (d), a construct called the the 
torque associated with the weight. If the total torque (i.e., Iwidi) associated 
with the weights on eaph &ide of the beam is tne same, the beam will balance. If 
not, the beam will tip to the side with the greater torque. Because of its form, 
this rule will subsequently be referred to as thte prod act -moment . r u 1 e, 

Thera are a numher of reasons why. balancing as an important and rich domain 
in which to study how subjects learn to combine information about variables. 
Most people have some understanding of the factors that determine whether 
balancing will occur, tven fairly young children can often identify weight ana 
distance from fulcrum as the critical variables anrj provide reasonable intuitive 
explanations of why a set of weights will or will not balance on the balance 
beam. On the other hand, relatively few adults can specify a rule that will 
allow them to predict what will happen m any given situation. In fact, over 
several experimental replications, only about of adults have produced 
responses to balance beam problems consistent witn the product-moment rule 

« 

(Jackson, ISbS; Lovell, Sieqler, 1^76). Even when provided with specific 

experiences intended to promote uncerstandinq of the concept of balancing, adultis. 



do not easily derive the prod act -wowent rule. It is the period of transit ion . , 
between being able to identify the relevant variables and being ablb to combine 
them f*o as to make correct predictions that is of major interest to us. 

Why is it so difficult to generate the product -moment rule? In fact, the 
rule is easy to use: Siegler <1976) taught three 10-year-old children the 
product-moment rule and they were subsequently able to use it successfully. It 
is clear, however, that the rule will not be derived if (1) the relevant features 
or dimensions of the problem are not identified, (2) only one of the features is 
considered or dominates the other, or (3) both features are considered, but not 
within a coordinated frarnework. There is considerable evidence suggessting that 
these reasons, in the order presented, describe stages of understanding many 
physical laws. Our previously stated interest in the transition phase implies 
that the third reason is our primary concern. In order to understand fully how a 
coordinated framework develops, however, it is necessary to consider earlier 
phases of understanding as well. 

There have been two major descriptions of the phases of understanding the 
balance beam: Inhelder and Piaget's (1956) stage theov^y, consisting of three ^ 
stages, each divided into two substages, and Siegler' s (1976; Klanr and Siegler, 
1978) hierarchical rule models. Both descriptive systems are presented in Table 
1. 



Insert Table 1 about here. 



Inhelder and Piaget (1958) presented two types of tasks. In the first task, 
they employed a balance beam that had holes at equal intervals on both sides of 
the fulcrum (£8 holes on each side) and weightp; of differing sizes that could be 
hunq at various distances from the fulcrum. In the second task, children were 
presented with a balance with no holes in the crossbar and instead of weights 



there was a baski^t on each side into which dolls could be placed. Mo units were 

/ ■ ■ . • 

marked along the crossbar. Subjects, who ranged in age from three to about 14 

years, were told to play with the balance beam to find out how it worked. 

ftlthc^gh in the first task' it was possible to hang weights at more than one 

» 

location on each side of the fulcrum, apparently even the most advancecj subjects 
did not do so. Therefore, the rule wj/wg = d?/di (henceforth referred to as the 

»* I 

^Catio rule), where wi and W2 are the amounts of weiyht on tach side of the 
fulcrum and dj and d£ the corresponding distances from the fulcrum, would always 
be sufficient to predict when balancing would occur, -and the general form of the 
product -moment rule was not needed. 

Examination of Inheider ana Piaget's stages in Table 1 suggest an 
increasingly systematic approach to understanding the influence of the variables 
of weight ana distanpe. . However, the usefulness of the aescription is limited by 
the vagueness of some of the terms employed, such as "qualitative understand ino. " 
In addition, it is not clear what is necessary to advance from one stage to the 
next. Finally, as we have indicated above, Inheider and Piaget discussed the 
ratio rule but not the most general form of the product -moment rule. 

c 

. Siegler <1976) used a balance beam that had fou*^ pegs on each side at 
equally spaced distances from the fulcrum and equally sized metal weights that 
had holes in their middles so that th^y could fit over the pegs. Subjects ranged 
from kindergartners to twelfth graders and had their understanding of the balance 
beam assessed after having participated in one of th- ee conditions. In the a 
priori condition, subjects were not given any tKperience with the beam before 
being tested for understanding. In the experimentation condition, subjects were 
told that there were rules by which they could could predict the action of the 
balance beam and that they should "experiment" with the beam and the weights and 
try to learn hot-v the beam worked. In the observation condition, subjects were 
also told that a rule existed and were presented with a predetermined series of 
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3& problems whose outcomes they could observe. . • <. * 

Under Bt and inq of the balance beam was assessed by presenting each subject 
with a series of 30 problems that would potentially distinguish different levels 
of knowledge, For each configuration of 'weights, subjects were, asked' what w<?uld 
happen if the balance beam (which was kept level by two wooden blocks that were 
placed beneath the arms of the baiance) was released. No feedback concerning the 
action of the balance beam was, provided. • y , «vy • ^ 

3iegler proposed that, performance on the balance beam can be described by 
the set of four hierarchical models presented in Table 1, and further^^ that an t 
individual progresses developmental ly from model to. model in an invariant order. 
Siegler classified the responses of 1(37 of the 120 children as conforming to the 
predictions made by one of the four models ana concluded that the study provided 
considerable support for the descriptive accuracy of the models. The responses 
of most (£3 of 313) b- and 6-year-olds were predicted by Model I, while the 
responses of most , (48 of 90) of the older children were predictf2d by Model III. 
Notably, the responses of only 5 of the 30 children in the oldest group (16- and 
17-year-old5) were predicted by Model IV. There was little effect of the varied 
prior exoerience on performance. 

Siegler' s (1976", 1978; Klahr and Siegler, 1978) rule-based set of 
developmental models has several advantages over Inhelder and Piaget's (1S58) 
st^ge theory. The most notable is the greatly simplified interpretation of 
complex behavior. The models make testable claims concerning which balance beam 
problems will be answered correctly by a subject performing at the level of one 
of the four i.'odels. 4n addition, specification of the models in terms of 
production systems (Klahr ano Siegler, 1978) make it possible to state how a 
model M.tGt be changed, by adding or modifying specific productions, in order to 
advance to the model of next increasing complexity. The clarity ami testability 
of these models allows one to pose quest icr.s relevant to instuction. For 
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.eKawple> one can investigate what kinds of instruction are rffective in producing 

movement to more cornpleK models and wnether people of different ages are 

differentially responsive to instruction. 

The simplicity of Siegler's hierarchical models (i.-aku nem an at'tractive 

system for predicting the behavior of peopl^j interacting with a balance- beam. 

However, several potential difficulties arise wnen they ere vi^wtd as models of 

the reasoning process, rather than merely ad predictors of behavior. One concern 

15 that, as Strauss ano Levin (1981) have pointed out, "...the rules of the rule 

system are the outcome of an interaction between task variables and an 

overarching cognitive system that attempts to deal with them "(1981, p. 76), so 

that ..the relative simplicity of Siegler's rules as compared to Inhelder and 

Piaget's descriptive system rnay in part reflect the more structured nature of the 

tasks that Sisgler employed. In addition, we will argue that there are reasons 

to beilieve that systematic predictions are based on considerations other than 

those suggested by the decision trees that make up Siegler's models and for that 

reason these decision trees are not adequate models of the reasoning process. 

There are three general areas of concern: (1) what is involved in the change from 

Model III to model IV, (£) how distance is encoded (i.e., whether distance is 

dealt with as an ordinal variable or encoded numerically) at various levels in 

the hier-irchy, and (3) the limitations in the knowledge the models allow to be 

/ 

used in making a decision about a balance problem. 

JjL?-J^Ci^IL5X^iJm^n.pm T)e hierarchical rule models are 

relatively weak in specifying how transitions between models occur, particularly 
from Model HI to Model IV. The production system formulation of the models 
presented by Klahr and Siegler (1978) allows one tq state what alterations must 
be made to Model III to yield Model IV. However, the prot^esses by which the 
changes occur are not obvious. 

Model IV is derived by altering Model III such that when a conflict problem 
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is encp\untered| predictions are no longer based on '^muddling through" (i.e^, 
randomly guessing) • Instead, they are based on computing and comparing the 
torques for each arm of the balance beam (i.e., predictions are based on the"* 
product -moment rule)- ft strict interpretation of Siegler's rule hit?rarchy as a 
developmental model; of reasoning would suggest th^tt t*ie tran^fC^rmat ion from a 
system dealing with conflict problems in a random way .(Model III) to* one dealing 
with thern correctly (Model IV) occurs in a single step. There seerns reason to 
believe that at least some subjects pass through^ ari intermediate stage in which 
they generate rules that incorporate both weight and distance but are of limited 
generality. In Inhelder and Piaget's classification, the quantitative rule 
arrived at in Stage HI is the ratio rule, not the more general product -moment 
rule. Moreover, Klahr and Siegler's <1976) detailed analysis of a single subject 
and description of Model III explanation^ advanced by subjects seem to indicate 
that the use of such rules is probably quite common. 

Siepler (1976) classified subjects' responses as conforming to Model III if 
there were (1) fewer than £6 correct responses on the 3iZ» posttest problems, (2) 
at least 13 correct reponses on the 12 nonconflict problems (in whi::h either 
weight or distance or both were tne same on both sides of the fulcrum or else the 
greater weight was on the same side as the greater tistance), and (3) more than 
four departures from complete reliance on the weight dimension on the 13 conflict 
problems. However, these criteria could have been met by subjects employing a 
variety of rules and strategies, such as using the ratio rule or special cases of 
it (e.g., 2:1), or even comparing the results of addin g weight and distance on 
each side of the fulcrum rather than multiplying them (which would correctly 
predict that configurations like 010i3/3J0i20 would -balance but would not work in 
general). It seems clear that Klahr and Sieqler themselves considered Model III 
to be an umbrella classification for a host of roore specific strategies that are 
i neons ii=.tently adopted. However, the existence of such stategies was mentioned 
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only parerithetically and' not incorporated into the description of Model III. 

f\r\ important question, .therefore, is whether subjects generate rules such 'as 
the 'ratio /ule,' that incorporate botlj. weight and distance but do not apply in all 
situ^itio/ts, before they \1 earn the mor^ general product -moment rule. 

Encoding of distance . \ ft second area of concern is the manner in which 
distance is encoded. The use of dVstance f.it^t occurs in Model II ip vjhich 
distance is considered^ if - tiie ^eights are the same on both sides of the fulcrum. .• 
However, the only judgments ^>bout distance necessary for Model II or Model III 
are ordinal: 'tha.t is^^^jhether distance on one side of the fulcrum is less than, ^ , 
•equal to, or greater than tne'.distance on the other. Only for Model ,IV must both* ^ 
weight anrf distance be encoded in a quant itat ive or numerical fashion so that 
torque may be" calculated. It is not clear wnether subjects performing at t^ie 
level of Model II or 111 en?.codB distance numerically, as .Steg^ler' s (19^). N ' 
research would seem to imply, make crude perceptual juagments, or .simply make, 

ordinal decisions about distance. If subjects do not encode distance.^' 

* . ■ ■ ■ \' 

numerically^ they will have difficulty generating tne product-moment rule and' 

this may result in the use of alti?rnativB strategies. • 

The manner in which subjects encode distance i's therefore of critical 
importance , in building a model of how they reason" be/ore a>;riving at the ' ^ "A 



product -moment rule. Tne aoparaCu? used ^ by Siegler (197fc) does not particularly 
lend itself to the investigation of this issue, si^ce th^four pegs on e^ch side 
of the fulcrum at which, weights could be placed are very sajient.. In the pre^sen^ 
study, we attempted to qain insight into how subjects ertcoded distance in a 
Eituation in wnch a measurement scale was provided but was notuso obvious. 

yge^ of inf orm at i on_ab_QjLt__previ.caj.5_jjrQ.^^ ' *^n additional concern is that 
^"^"^ccordir.o to Siegler' s rule-based models, when dealing with a balance proolem the 
subject will go through a decision tree but will not ute information about either 
specific problems or general types of problems encountered previously. It seems 
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likely, however, that at least 'Bome of the tjrne, subjects making'' predictions 
about a series of balance proDlems will compare a given 'configuration of weights 
with those observed earlier. In fact, some of the experience with the balance 
beam that Siegler provided his subjects would seern to encourage sucn comparisons. 
In his observati^ condition, subjects were presented with four sets of five 
problems each in which problem n differed from problem n±i by only tt e addition 
or removal of a single weight. In such a series of problems, there are severail 
points at which the correct outcome could be predicted' on a logical basis merely 
by considering the change from the previous problem. For exa/nple, if a single 
.weight is added 'to one arm of a configuration that previously balanced, the beam 

must tip to that side. 
♦ 

'On^ basis for building a rule-based set of models is the assumption that 
because the final state of learning can be described in terms of a r^^e, the 
intermediate stages are also best describe(J as rules. There seems reason to 
question this assumption. Brooks (1978) showed that when a is available for 

categorizing a set of Exemplars, but is sufficiently complex and difficult to 
induc^suDject|S categorize new instances .-^f a category m terms of their 
similarity to previously observed instances. They, do not necessarily generate an 
approximation to the ^rule to use as a-basis for categori^e^it ion. Olthough the 

product-moment law is not as complex a rule for categorizatibn as the examples 

■ ■ ' J 

used by Broods, it is obviously difficult to induce, suggesting that at 
intermediate stages of learning subjects may make predictions by comparing the 

current problem to representations of previous'problems.. Pt the very least, 

^ . . . . 

Erooks' findings suggest the possibility that reasoning before the product-moment 

rule has been learned is not best characterized as the use of simple rules. 

One rnight consider all balance , Beam problems to bp members of one of three 

categories: (\) "balance" configurations; (2) "tip right "„ configurat ions, or (3) 

"tiD left" configurations. .Correct categorization 6f all balance beam problems 

\ 
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into one of these three caiegories would correspond to Model IV behavior. Thus, 
the decision aDout a balance problem can be conceived of as a problem of 
categorization, and the relevance of thRories of boncept formation can be 
considered. 

There is consensus that pnenomena associated with the categorization of 
natural objecls can be accounted for by some combination of two types of models, 
exemplar models and prototype . models. Exemplar models propose that new exemplars 
are categorized on the basis of comparisons with stored rt^presentat ions of 
previously experienced exemplars. In addition, exemplars differ from each other 
.in how well they represent the category and because of this, thd boundaries among 
natural object catpgories are considered to be "fuzzy (M^rvis and Rosch, 1981). 
Good exemplars of a category share many features with other exemplars and few 
features with members of neighboring categories. Poor exemplars share fewer 
features with other examplars and more features with members of neighboring 
categories (Mervis and Rosch, 1981; homa, 1984). 

Prototype models propose that new exemplars are categorized on the ba^is of 
comparisons with abstract summary oescript ions of categories, fl prototype can be 
thought of as the weighted average of the features of exemplars of the category, 
thus reflecting the occurrence of relatively common and uncommon features (Smith 
and Med in, 1981). Whether protoype or exemplar information is used appears to 
depend somewhat on the experimental concext (Medin, Alton, and Murphy, 198A). 
Smith and Medin (1981) suggest that both types of information might be used in 
some situations and Malt (personal communication) has found an experimental 
condition in which this appears to be the case. 

The categories of balance, tip right, and tip left are somewhat different^ 
from those typically studied in experiments dealinr with natural categories. 
However, exnmp. ^rs of these categories are similar to those that have been 
studied in that they are generally viewed one at a time and differ in how 
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representative they^^^^a^^-f the category to which they belong, (i.e., the beam 
can be slight'ly balance). It is possible that subjects use similar 
strategies for categorizing exemplars'?' whether these exemplars are natural 
objects or configurations of weights. Even when told that a rule exists that 
will allow correct predictions for all balance problems, subjects may, at least 
before tne rule has been generited, attempt to deal with balance problems' by 
comparing them with some representation of previously observed problems. 

In summary, although the rule-based models proposed by Siegler (1976) and 
Klahr and Siegler (1978) have appeal as a system within which to describe the 
predictions people make wnen given balance problems, there seem to be reasons to 
believe that they are not adequate models of the reasoning process. In the 
present study, r.n attempt was made to determine whether, when provided with a 
series of balance problems and given the *-ask of inducing the pro duct -moment 
rule, p ople (1) generate rules of lim. generality that involve both weight 
and distance before they learn the genti-^i^'product -moment rule, (2) encode 
distance numerically at different levels of performance, and (3) base judgments 
about .balance problems on specific information about previously encountered 
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problems or classes of problems. ^ *t 

The study consisted of two phases, ^n the pretest {«>ase, a paper-and 
test was used to identify subjects who had not yetl learned the product -moment 
rule. In the training phase, £2 of these subjects were presented with a series 
of balance problems using wopden blocks and a balance beam. Although the set of 
talance problems was modeled after that used in Siegler' s (1976) observation 
condition, the procedure differed in several important ways. In Siegler' s study, 
the observation condition consisted of 36 trials< on each of which the 
experimenter placed weights on the balance beam, removed the wooden blocks 
holding the beam level, and allowed the subjects 10 seconds to observe the 
outcome. In the present study, (1) subjects were told to predict the outcome for 
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each balance problem before beinq allowed to observe the outcome, (2) subjects 
were esked to think aloud while' performing the task and, if possible, to provide 
justification for their prediction, and <3) the training phase continued until 
subjects gave answers consistent with the product -moment rule. 



Method 



• Pretes t Pha&e 

SubjLBCt^. Forty-eight students <3(3 women and 18 men) enrolled in psychology 
classes at the University of Massachusetts received bonus credit for 
participation in the pretest phase of the study, ftge ranged from 17 years to 36 
years with a jnean of £(3.5 years. 

Problem^. The prestest consisted of 12 schematic line drawings of a balance 
beam presented with various configurations of weights. These problems varied in 
both type ana difficulty. There were tnree "simple" problems that did not 
require the product -moment rule for correct predictions because either (I) weight 
or distance or both were equal on both sides of the fulcrum (e.g., fH^m/HZm ~ 
Note: notation indicates the number of weights 1,2,3, and 4 units of distance on 
either side of the fulcrum — in this case, there are tnree weiyhts located three 
units of distance to the left of the fulcrum and three weights located two units 
of distance to the right of the fulcrum) or (£) the greater weight was associated 
with the greater distance (e.g., 012i3/ll«iZi) . The remaining nine problems 
represented "conflict" situations in which the greater weight was associated with 
the lesser distance. For three of these problems, the beam would have tipped to 
the Bide with greater weight (e.g., «iic;Sei/W2) and for three it would have tipped 
to the Side with the greater distance (e.g., limfaB^Q), The remainina three 
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problems presented balance situations (e.g., im/izm). The problems employed 
depicted situations with weights placed at no more than two locations on either 

side of the fulcrum. 

Proce dure . The pretest was administered to groups of three or four 
subjects. Subjects were instructed to predict whether the balance beam presented 
in each problem would tip to the left, balance, or tip to the right. No time 
limit was imposed and feedback was not provided. 

Analysis. All subjects who made three or more incorrect predictions were 
classified as nonbalancers. Thirty-six of the AS subjects who took the prestest 
were classified as nonbalancers. 
Trainin g Pt!ja&? 

Subjects. Twenty-two of the 36 subjects classified as nonbalancers on the 
prestest (16 men and 6 women) were randomly chosen to participate in the training 
phase of the study. These subjects had a mean score of 6.40 correct answers on 
the pretest with a standard deviation of 1.53. 

Ma terial s. The balance beam used in the the training phase consisted of a 
flat, rigid aluminum bar balanced on a fixed fulcrum at its midpoint. Distance 
from the fulcrum was denoted by marks at regular intervals drawn on a lightweight 
acetate scale placed on top of the aluminum bar. The unit marks were . . 

approximately 5 cm. apart, and half units were also indicated. It should be 
emphasized that in contrast to Siegler's .(1976) study in which weights could bs 
placed only on four pegs on each side of the fulcrum, in the present study, 
weights could be moved continuously along the surface of the beam. The weights 
were wooden cubes whose sides were approximately 3.5 cm. 

Problems. The set of problems used was modeled after that used in Siegler's 
(1S76) observation condition. Problems presented near the beginning of the 
session were relatively simple and became progressively more complex. The first 
few problems consisted of single blocks or stacks of blocks placed on each 'ide 
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of the fulcrum such that either the weight or distance (or both) was equal on 

both sides of the fulcrum. These pi . blerns were followed by three sequences of 

problems each of which beqan with two equal stacks of blocks, one placed closer 

to the fulcrum than the other. Blocks were added. one at a time to the stack 

closer to the fulcrum until the beam balanced and then tipped in the opposite 

direction. Thus subjects saw how problems with conflicting weight and distance 

could tip to either side or balance. After completing these three sequences of 

proolems, the balance situations from each sequence were repeated. The final and 
♦ 

most complex problems employed two stacks of blocks on one or both sides of the 
fulcrum. There were up to six blocks per stack. 

P rocedure. Before the session began, the subject was informed of and 
consented to the videotaping of the interview. Each subject was interviewed 
individually, while seated opposite the interviewer with the balance beam between 
them. 

The subject was told tnat he or she would be given a series of balance 
problems. The instructions stated that the task for each individual, problem was 
to predict whether the beam would tip to the left, balance, or tip to the .right; 
and further, that there was a general rule which would allow correct prediction 
for all the problems and that the subject should attempt to determine this rule, 
Subjects were asked to think aloud while performing the task and if possible to 
provide justifications for the responses they made. 

For each problem, the interviewer placed blocks on the balance beam while 
holding it level, ftfter the subject made a prediction and commented on it^ the 
beam was released so that the subject could determine whether the prediction was 
correct. The interviewer then continued with the next problem. 

In the first part of the training phase, all subjects received the same 
problems in the same order. There was, however, some variation in the later 
problems, as subjects were allowed to request that c -^articular configuration of 



weights be set up at any tirne, although they were not specifically instructed to 
do so. In addition, if the subject stated an incorrect hypothesis about the 
balance beam, the interviewer attempted to set up special problems that 
contradicted the hypothesis. The criterion for concluding that the subject had 
induced the product -moment rule was five consecutive correct predictions on 
complex problems (i.e., conflict problems with more than one stack of weights on 
each side of the fulcrum). 

If, after 40 problems, the subject had not made any hypotheses about general 
rules and did not appear to be counting the numbers of distance units, the 
.interviewer indirectly prompted the subject to pay closer attention to distance. 
She did so by asking "How far out is that pile of blocks?" when the subject made 
a vague comment that a particular stack of blocks was "closer in" or "further 
out." 

f^na lvsis . Three types of information were of major interest: (1) evidence 
that subjects based their reasoning on rules of limited generality that involved 
both weight , and distance before they learned the product -moment rule, <£) 
evidence that subjects either did or did not encode distance quantitatively at 
different levels of performance, and <3) evidence that suhjects based predictions 
on information about previously encountered problems or classes of problems. 

SubjiL-os were consioered to have used ratio reasoning if they stated either 
before or after making a prediction the relationship between the ratios of weight 
and distance in the problem. Verbal references to counting and pointing to the 
distance marks were considered to be positive evidence of numeric encoding of 
distance. In addition, evidence for the lack of numeric encoding of distance was 
•noted. Indications of gross perceptual judgments such leaning back to judge 
distance, commer.ts such as "It's hard to tell whether they're the same," and 
questions such as "Are the lines on the beam important?" were cc .sidered to be 
evidt-nce that people were not encoding distance numerically. These two type^ of 
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information were coded by one person. 

The coding of reasoning based on prior instances was somewhat more 
complicated, given the less explicit nature of the responses. Coding was done by 
two independent coders with disagreements resolved by a third, so that a datum 
had to be agreed upon oy at least two of 'the three coders before being counted. 
Reasoning from particular proolems and from particular classes of problems (i.e., 
those with a particular relationship between the ratios of weight and distance) 
were included in this classification. References to particular instances were 
broken down into problems in which a single change had been made to the previous 
proDlem and those in which two changes had been made to the previous problem. 
The reasoning was coded as logically correct or incorrect based on the 
relationship Dstween the previous anc the current problem and the subject's 
response. For example, if the subject noted that the beam was previously 
balanced and one block was added so that it must now tip to the side of the added 
block, he or she was logically correct. However, if the beam had been tipped to 
one side and one block was now added to the other side, the subject would be 
incorrect in reasoning only on the basis of that fact that the beam should now 
balance. 

References to classes of conf igurat-'.ons previously encountered were also 
coded into logically correct and incorrect forms. Correct forms included (1) two 
sets of ratios that balance will balance if "jdded together" (e.g., mZ^/fd^Zl 
must balance because it is composed of 000^/82630 and 0020/0001, both of which 
balance), and (£) a configuration of weights will not balance if it differs in a 
critical way from a ratio configuration that does balance (e.g., 0100/3100 will 
not balance because it differs by one block from -0100/3000 which does balance). 
Incorrect predictions included those based on the similarity between the current 
conficuration ana one previously encountered (e.g., 1100/£c:00 is predicted to 
balance because it is similar to 2000/0400). 
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All £2 subjects were aDle to meet the criterion for learning the 
product -moment rule. The mean number of trials required to meet the criterion 
was 49.1^1 With a standard oeviation of IS. 4 (ranoe £5 to 68 trials). Analysis of 
the data indicated that subjects engaged in a variety of behaviors as they 
fcttteiflDted to determine whether the balance beam would balance. Not all common 
behaviors were observed in all subjects, suggesting that there may be no- 
invariant seouence of behaviors tr»at subjects must engage in while inducing the 
product-moment rule. There were three major findings observed in the interview 
data; (1) most of tne subjects seemed to develop and use the ratio rule before 
using the product -moment rule; (2) many subjects gave evidence of not encoding 
ciatance numerically during the first part of the session; and <3) most subjects 
seemed to employ specific information about previously experienced configurat ions 
in making decisions about balance problems. 

Ih e use of ru le s o f Umite d gene ral itv 

At least 15 of the £2 subjects employed a quantitative rule that involved 
both weight and distance but was of limited generality before generating the 
product -moment rule. One subject generated ana teste the hypothesis that 
balance would occur if the sum of weight and distance on each side of the fulcrum 
<B. g., a stack of three blocks one unit of distance away from the fulcrum would 
yield a sum of four) was equal. Fourteen subjects explicitly verbalized a form 
of the ratio rule at some point during the interview (e.g., made a statement like 
"it should balance because the stack on the left has twice as many blocks but the 
f.tack on the right \s twice as far from the center). On the average, for these 
14 subjects, the rule was first verbalized on trial £1.1 <5d= U.£S wall before 
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tne criterion for the product -moment rule was reached. There was little 
difference in the number of trials to criterion ror the lA subjects who 
verbalized the ratio rule (fnean=5i3. 1, sd«14.0) and the eight who did not 
(mean=A7. 1, sd=17.6>). 

In general, subjects ^^ho stated sotne form of the ratio rule did not do so on 
the first simole balance problem (i.e., a problem in which there were single, 
different-sized stacks of blocks on each side of the fulcrum and the beam 
balanced) tney encountered. The first simple balance problems were presented on 
■ triAL-7...<Ji?3/0eilia) and trial 11 <id!Zi04/i2i£Oei) . Only one subject stated tne ratio 
•rule on trial 7 and only three others did so on trial 11. 

There seemed to be a tendency for subjects to state the ratio rule first on 
a trial for which the ratio was 2:1, suggesting that the rule was first generated 
m simple situations and then generalized. Ten of the 14 subjects who stated the 
ratio rule did so first on a trial for which the configuration was mUh/fdZm or 
(c^ma/idim, desoite tne fact that all 18 of them had encountered one 3:1 ratio 
balance problem earlier (trial 7) and that six of them had previously encountered 
eit least two simple balance problems in which the ratio was not £!l. 

Stating a form of the ratio rule correctly did not invariably result in 
correct predictions for all subsequently presented problems that could have been 
easily handled by u&e of the ratio rule. Apparently, subjects either did not 
matialiy learn the ceneral form of the r.ule, did not employ the ratio rule as 
their exclusive heuristic for two-stack conflict problems, or did not always 
encode Distance accurately enough to predict correctly. On the average, the 14 
snbiects missed 1.6 of the 6.6 simple balance problems they received between 
first verbalizing a form of the ratio rule and reaching criterion. In addition, 
thf?y rncejvt^d an averaoe of 2.4 simple imbalance problems (two-stack problems in 
v.lxch tnere was more weight on one side of the fulcrum and greater distance on 
the other but for which the ratios of weight and distance were not equal) and 
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made an average of .S errors on them. 

aome error dava supgest tha^. subjects way have first learned the rule for 
one ratio and only later generalized it to others. Seven of the 10 subjects who 
first stated tne ratio rule o^ a simple balance problem with ratio 2:1 
subsequently maoe errors on other simple balance problems, but only one of thern 
did so on a problem with ratio Collectively, these subjects made 16 errors 

on simple balanca problems after first stating the ratio rule, including five 
errors on problems with a ratio, five on problems with a 3:£ ratio, and four 
on problems w'th a 4:1 ratio. The four subjects who first stated the ratio rule, 
on a 3:1 or ^:1 ratio trial later committed six errors on simple balance trials, 
only ens of which was on the same kind of trial. 

Other data suggest that suDjects fj^ay have wade .some errors on ratio trials 

by not jucging distance carefully enough, fts will be discussed later, subjects 
t>^eQuently aid not use the markings on the scale to help encode distance 
numerical end at least early in the session, tended to rely on ordinal or rough 
perceptual judgments. The hypothesis that errors on simple balance and imbalance 
problems were in part caused by the failure to encode distance numerically is 
supported by the the fact that over all £2 subjects, only four errors on these 
types of problems '^are committed after subjects had givtn some indication that 
tney were encodii g distance numerically. Other suggestivt? evidence is provided 
by the ^'act thct of the 11 errors maoe on simple imbalance trials after first 
verbalization of tne ratio rule, six were committed on the configuration 
C51(?^/0£2»k:., while the others were committed on £ia«41/ia03ta, C003/;)10ei, I0en3/30end, 
v:mZ/1<c^Zy and ciZiai^/AiZ-'^i?. With the enceotion of the last configuration, each of 
the others could De made to balance by moving one of the stacks by no more than 
omv-half unit. It is difficult to rule out tne possuility, therefore, that 
cvuoe perceptual judgments of distance were a contributing factor to this type of 
error. 
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In suromary, more than two-thirds of the subjects explicitly stated a 
quantitative ruJe involvinq both weignt and distance before learning the . 
product-rnowent rule. There is some evidence that subjects who verbalized a form 
of the ratio rule first tended to learn the rule for one ratio (usually £:1) and 
then generalized it to others. Errors on simple ratio problems encountered after 
first verbalization of the ratio rule occurred possibly because: (1) the rule had 
not yet become generalized, (£) subjects made crude judgments about distance, and 
<3) subjects also used other heuristics such as instance-based reasoning (to be 
discussed later). 

t need y:»g^^CiLjd isia 

Tner'- ..as a substantial dur.ount of evidence suggesting tiiat subjects did not 
encode distance numericaHy tarly in the session. This evidence consisted of 
several ty.»es: U) positive indication of beginning to use the scale to encode 
distance numericallv, suggestir:: that the scale had not previously been used in 
this fashion and (2) behavior or verbal statements indicating thai subjects were 
encoding distance on the basis of crude perceptual estimates. In addition, some- 
commerits suggested that subjects encoded distance on an ordinal scale. 

Scal e f.se . Seventeen of the £2 subjects spontaneously gave evidence of 
counting or pointing to the unit marks on the scale in order to encode distance 
numerically. The mean trial on which they did so was £8.3 (sd«lia. 6, range of 
trial 14 to The remaining five subjects did not give clear evidence of 

using the seals until they received a fairly exDlicit prompt from the interviewer 
c\fter trial 48 (see method section). 

Thus most suDjects did not overtly use tne scale until after trial £13. This 
IS to be contraste^d with simcly mentioning the use of distance either while 
thintting aloud or m eKolaining wi^y a particular decision had been made. On the 
average, distance wac first mentioned on trial £ (sd=6. £, ranne of trial 1 to 
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trial 24). Howeveri the distribution was quite skewed. Thirteen of the 22 
subjects mentioned distance within the first three trials and only two took more 
than 15 trials to do so. The larcie average lag of tZ trials between comment inq 
on distance and overtly count inq distance units strongly suggests tnat subjects 
did not initially encooe distance numerically. 

Not all of the 17 subjects who spontaneously used the scale to measu'lr^ 
distariw^e seemeo, at least initially, to realize the importance of doing so. 
Altnoupn H3 subjects continued to use the scale consistently after the first 
trial on which they gave eviaence of counting, the remaining seven did not, 
requiring an average of an additional 13.4 trials <sd=6. C) before consistently 
using the scale^ to. encode distance. 

Of course, using ovei c counting of scale units provides an upper bound for 
the trial on which the subject begins to encode distance numerically, since it is 
possible to encox^e. distance numerically without providing any evidence of doing 
so. ^accordingly, enplicit signs that subjects were not using the scale were 
sought. 

B vidence o f not using tne scaje. Nine cf tne £2 subjects pave positive 
evidence that they were not using the scale at some point between the first trial 
on which they mentioned Distance ahd tne first trial on which they overtly gave 
an indication of cA)inting. Three of these subjects made comments that indicated 
they had just started to consider using the lines on the scale: one asked whether 
the lines -?re Important (trial 13), while the other two asked whether the 
diHtances )etween the marks were equal (trials 33 and 56). Three different 
subjects indicated that they were judging distance in a crude perceptual r«anner, 
one commenting tnat she needed a different perspective and moving her head (trial 
£8), and two others physically moving their chairs back and changing their lines 
of view (tv^ials 16 and 3c:>. The remaining three subjects commented on the 
difficulty of jucging distance, indicating that they did not realize that the 
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scale was present to aid in those judgments (trials 12, £5,34). 

The fact that more tnan one-third of the subjects gave explicit indications 
that they were not using the scale to measure distance, in combination with- the 
relatively large number of trials it took .to provide positive evidence of using 
the scale, strongly suggests that subjects did not use the scale early in the 
t-BBsion. The fact that subjects mentioned distance early indicates that distance 
was encoded, but specifically how this was done is open to speculation. 

BUeCKiatiyeJ^yP^LthBses. If distance was not encooed numerically using tne 
scale provided, there remain several possibilities, about how it might have been 
encoded: (1) suDjects may have tried to estimate distance quantitatively on the 
basis of perceptual judgments, or (£) subjects may have simply encoded distance 
on an ordinal scale, oroering distances rather than attaching numerical values to 
thern. 

Several subjects cave clear evidence of trying to estimate distance 
precisely without using the scale, e.g., (in response to lZi01Zi4/00iZH ) , "It just 
looks like this [referring to the weight on the right] is four times the distance 
of these." However, there were a large number of comments that suggested 
subjects were indeed encoding distance on an ordinal scale, tighteen. of the 
subjects maoe comments of this-,type at some point during the interview prior to 
pivmg explicit evidence of counting. Typical examples of such comments were (in 
response to zm^^'<L^^^)^ ""It might balance because they're in pretty far Cox) the 
riciht] to have too much of an effect to go down," or "This one is farther over 
and It's less weight and this one is closer and it's more weight." Statements of 
tnis tyoB referred only, to order relations, contained no reference to numbers of 

t 
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distance units, and were clearly distinguishable from statements that were later 
r.iaue after count incj had dequn, such as (in response to \m\^m^^)y "There's 
1,2,3,4,5,6 CsuDject counts half-units3, it's on the sixth line. Six and six is 
twt»lve" tsubjBct ados tne distance for each of the weights on tne rigntJ.- 
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Comments abour, oraer relatxons were noticeably absent from these later 
statements. 

Pll comments suggesting tne use of ordinal logic were tabulated for each 
subject. This coaing included all trials in which a statement of an ordinal 
relation such as "farther out," closer in," or "closer to the end" was made. Two 
types of cases were then excluded: <1) cases in. which numerical encoding of 
Distance wcls unnecessary J^,U<>whBn weignt was tne same on either side of the 
fulcrum or when the grei(t^Br weight was on the same siae of the fulcrum as the 
Dreater distance) ano t£) cases. in which a soecific reference was made to- an 
earlier problem. There were, on the average, 5.5 trials per subject in which 
Btatements sugqestmo tne use of ordinal loqic was made, after the two types of 
trials mentioned above'^were excluded. Eliminating the data q-f one subject who 
roaoe orcinal comments on 2ti trials gives a mean of 4.4 such t>-ialsr^r slibject 
(sd=3.5). Tnese aata sugpest'that early in the session, subjects commonly dealt 
with dist^ance in an ordinal fashion. 

3>ie use of inform ation ab out previ gMS_Bn.9 Pi gffis 

There were an average of 7,4 trials per subject (sd=3.6, range £ to 16) that 
"^could be clearly documented on which subjects maoe their predictions on the basis 
of a comparison with a previous problem or class of problems. Subjects made 
comments that chey were using specific information about previous problems on 
more than 15% of trials, despite the fact that they were c;;:ilicit ly told tnat 
their goal was to l|arn a general rule. 

•|hreejt_y.pes.._ There were three basic types of instance-based reasoning 
observea: (1) reasoning applied to a problem that differed from a previous 
problem by a single transformation, such as the addition of a block or the 
movement of a stack one unit of distance; Vd) reasoning about, a problem that 
ciffered from a previous problem bv more than one transformation; and (5) 
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reasoning in which the current problem Mas compared to a ».nown ratio 

configuration. Valid ana invalid me^nods of reasoning within each of the tnree 

basic .type^s will be discussed below. 

,^An^l^Jti:«!?MP»:yiti.Sr!i. subjects verbalized tne use of instance-based 

reasoning on an average of 3.7 problems in which the current problem was compared 

to a simple (one fetack of blocks on each side of the fulcrum) problem that 

differed frorn it on the basis of a single transformation. .There were three 

suDtyOBS of reasoning which were sensible heuristics in that they, produced 

correct answers some off the time. However, only one was consistelntly valid. The 

> 

valid form of reasoning (verbalized qn ari^,-avBragB of 1.1 trials per subject) was 
that if a single transformation was made to a configuration that balanced, the 
r.ew configuration should no longer ballnce. For examole, if ^^\^^<L^m is^SBBn to 
balancB, ^thBn ^X^^rdm^ must not balance since only a single transformation h6s 
been made. One form of invalid rBasonino (verDalizBd or, an average of l.C trials 
per subject) was that if the prBSBnt problem was similar enough tp a previous One 
(differing by only one transformation), the outcome snould be the same. The 
second type of invalid reasoning (1.5 trials per subject) was that if a single 
compensatory transformation is made to a configuration that did not balance, the 
new configuration should balance. Although this argument leads to correct 
predictions in some cases, it certainly coes not do so in general. 

Mullilllejtransfor^a^^^ There were an average of £.3 problems per subject 
"in wmch the prediction was made by comparing the current problem with a previous 
simple problem that differed from it by more than one transformation. The valid 
torm«i of reasoning (average of 0.7 trials per subject) included: (V) if two 
stacks are equally distant from the fulcrum, equal numbers of weights added to 
€?ach stack should not chance tne outcome (e.g., mz^l^tzm snould tip to the left 
given that Ctiiia/idlt?^ did so) and (2). given two stacks of weiVQhts at different 
Distances from the fulcrum, an inverse proportional change in the number of 
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.weights on either sloe will leave the outcome unchanged (i.e., application of the 
raticTrule as a transformation). The invalid forms (1.5 trials per subject) 
includeo: (1) equs^l numbers of weights added to (or subtracted from) stacks at 
different*' distances from the fulcrum leave the outcome unchanged (e. if 
(21080/401212 balances than so should lZi01l2)/2iZi0tZi) and (8) the addition (or 
subtraction) of a block to (or from) a stack in conjunction with the movement of 
the stack on the opposite side out (or in) one unit of distance will leave the 
outcoi(»e unchanged (e.g., if diisei/Aiaiifiei balances than so should ^800/5000) . 

Cofyiparison s with kn ow n ratios. In addition there were several types of 
reasoning in which the current complex problem (a problem with more than one 
stack oh at least one sioe of the fulcrum) was compared with a known problem that 
was known to balance because of the ratio rule. Valid forms of such reasoning 
(0.7 trials per subject) included: (1) stating that the current problem should 
not balance because it tiiffereo in a critical way from one that would be 
predicted to balance using the ratio rule (e.g., 1100/4000 should not balance 

r 

because tne ratio rule would predict that 1000/4000 snould balance) ana (£) 
stating that the current problem should balance because it is the sum of two 
individual ratios each of which balance (e. g^, 10S0/X200 should balance because 
1000/0200 and 0020/40ii;'0 both balance). In addition, if the current configuration 
was^iiiular enough" to a ratio configuration for which the outcome was knownv 
the outcome was sometimes (0.7 trial per subject) judged to be the same. 




The major goal of the present study was to characterize somfi of the changes 
that occur when subjects were presented with a series of balance problems and 
given the task of inducing a rule that would allow them to predict the outcomes. 
In particular, we were concerned with whether the development of learning could 
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be adequately characterized by Siegler's (1976) rule-based hierarchy of models or 
whether heuristics that do not fit neatly into this hierarchy, like differential 
encoding of distance, use of the ratio rule, and reasoning on the basis of 
previous instances of problems must also be considered. Because our data 
stront ly suggest that such heuristics are indeed used by subjects, Siegler's 
(.1976) hierarchy does not adequately describe the variety and complexity of 
reasoning processes that subjects engage in while attempting to induce a general 
rule. 

ft_we 1 1 _d e f 1 n ed s eo uence 7 It is unlikely that. any simple stage analysis can 
characterize the changes in knowledge states in more than a superficial manner. 
Our analysis of the protocols does not depict the subject as relentlessly 
progressing through a well defined sequence of levels until the product -moment 
rule IS reached. Distance may be encoded quantitatively on one trial but not on 
the next. The ratio rule may be applied t.o make a correct prediction for a 
problem and not be applied when a similar problem is presented a few trials 
later. The strategy employed depends on the particular problem and on problems 

that were encountered earlier. 

Mod el -determined encoding o f di stance. The protocols suggest that initially, 
subjects frequently rely on relatively primitive encoding of distance, ftfter 
they have made predictions about a number of simple balance problems, they tend 
to progress to the use of some mixture of the ratio rule (where appropriate) and 
instance-based reasoning and begin to consider more complex hypotheses. It seems 
likely that the level of encoding employed by subjects is largely determined by 
the heuristics or models they are using. The argument for this is twofold. 
First, our subjectc were of college age and it is extremely likely that they 
thought of cii stance as a quantitative concept in general. Thus, any failure to 
encode distance in a quantitative fashion more plausibly represents a failure to 
apply an existing concept of quantitative distance rather than the lack of the 
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concept. Second, there were occasions on which subjects used qualitative 
encoding after having earlier given some indication of encoding distance 
quantitatively, such as counting scale units or using the ratio rule. It seems 
plausible that some occurrences of qualitative encoding resulted from the use of 
more qualitative models, such as those involving instance-based reasoning. The 
pattern of reasoning seems similar to that engaged in by a mathematician or 
logician when he or she attempts to understand a problem on an intuitive, 
informal level before trying to formalize a hypothesis. 

The present hypothesis that the encoding employed by subjects will depend on 
the heuy^istics or models they use, contrasts with conclusions that might be drawn 
from Siegler's (1976) work on encoding with five- and eight-year-'olds. Siegler 
demonstrated that training children to encode both weight and distance 
significantly improved their performance on balance beam problems. Although 
quantitative encoding is necessary for an adequate understanding of the balance 
beam, it is not sufficient for understanding. While it is possible that learning 
how to encode distance quantitatively can be considered responsible for young 
children's progress in understanding the balance beam, this explanation has less 
appeal, when applied to older subjects who already know how to encode distance 
quantitatively. 

Insta nc e-based reasonino . The use of reasoning about problems by comparing 
them to previously experienced problems seems to be outside the scope of a 
rule-based model of learning. In fact, one might think that when subjects were 
instructed to gev^.csrate a general rulev the demand characteristics of the task 
would preclude reasoning based on individual instances. However, the prevalence 
of such reasoning is less surprising within the context of a growing literature 
concerned with the learning of complex concepts. Studies have demonstrated that 
learning to classify instances of a concept by a complex rule or set of rules, 
for example the pronunciation of Enqlish words (Brooks, 1978) or the learning of 
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syntactic structures <Reber, 1976), is most parsimoniously accounted for by a 
theory that claims subject categorize new instances by comparing thern to specific 
old instances. The product -moment rule may be sufficiently complex that basing a 
prediction on previous instances involves less strain on cognitive resources than 
immediately trying to infer z quantitative law. 

If one assumes that instance-based reasoning was prevalent, it is possible 
to develop an explanation for why subjects were not more consistent in their use 
of the ratio rule once they had first verbal izea a form of the rule. On 'a given 
problem, subjects may have used whatever information was available and seemed 
appropriate for that problem. In addition, there was some evidence tnat subjects 
first learned the Y^atio rule for a specific ratio and later generalized it. When 
subjects first verbalized a form of the ratio rule, they rarely made comments 
suggesting that the rule should hold for all ratios and in some cases expressed 
doubts that the relationship observed in a few instances was general izabie. For 
example, one subject said in considering whether the ratio rule verbalized 
earlier for a smaller ratio would hold for 4:1, "I know we kept the proportion 
the same, but I thought that there was a point at which you went one too many 
down here [referring to distance: and tne ratio didn't stay the same just because 
it was so far out on the end. You want to put one more block on there and move 
that one more? [requesting that the interviewer modify the problem from 

to Hii2i2iei/bei0ia3 Oh, you mean it's a constant rule, it doesn't change? 
Cafter observing tnat the beam still balanced]. 

|jhy_dp_subj9Ct s proore ss? The documentation and description of a change in 
understanding cannot in itself provide an explanation of the mechanism producing 
the change, fit most, the protocol provides a record of the stopping places on 
the journey. Nonetheless, we believe that documenting the changes in 
understanding demonstr-ted by subjects, and inferring the kinds of heuristics 
thev employ, represents an advance in our understanding of how subjects learn to 
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reason correctly about balance problems. At the very least, it is clear that our 
training procedure was sufficient, in that all of our. subjects were able to 
generate the product -moment rule within a reasonable amount of time. 

This findinq contrasts with the results Siegler :i97fc) obtained with younger 
subjects that indicated relatively few subjects der .ved the product -moment rule. 
While it IS possible that the difference in results was due to the aqe of tne 
subjects, there are several procedural differences between these training studies 
that seem relevant. First subjects in the present study took 43.1? trials to 
reach criterion, whereas subjects in Siegler' s observation condition received 
only 36 trials. Thus Siegler's observation condition may not have provided 
enough trials to be effective. The second, and .viore interesting possibility is 
that a critical difference may have been that in the present study subjects were 
asked to generate predictions for each prDbleni and to justify these predictions 
if possible, While Siegler' s subjects merely observe*] the problems and the 
outcomes. More active involvement with the problems may lead to more active 
hypothesis formation and hence faster learning. This interpretation is supported 
by recent research by Lewis and ftnderson (1985)^ who found in a study of the 
acquisition of problem solving operators that subjects learned correlations 
between problem features and operators only when they were forced to make and 
test exDlicit hypotheses. Clearly, active learning should be an important topic 
of research in the the future. Siegler and Klahr (19B£) also conducted another 
training study in which the majority of college-age subjects were able to learn 
to provide predictions consistent with Model IV after being presented with a 
sequence of balance-scale feeddack pv'obiems, provided they received either 
external memory aids (a .sheet of paper with schematic representations of each 
problem and its outcome). Quantified encoding (a procedure in which the 
mathematical nature of the task ;*as highlighted), or both. Siegler (personal - 
communication) has suggested that the effectiveness of the external memory aids 
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Manipulation may be eKplained in terms of making specific instances more 
available. 

Educa ti onal value of t h ejp oncept o f balancin g. The topic of how people come 
to understanJ the balance lieam gains significance when one considers that the 
balancing schema might influence the acquisition of mathematical concepts. One 
recent study has indicated that knowledge about the balance beam may facilitate 
understanding of the superficially unrelated concept of the mean (Hardiman, Well, 
and Pollatsek, 1984). Nearly all college students know the standard algorithm 
for computing the mean of a set of scores (i.e., add tne scores and divide by the 
number of scores) and can apply it correctly when the scores provided are based 
on. equal numbers of observations. However, relatively few adults can 
successfully compute the overall mean when given two subgroup means based on 
different numbers of scores. They do not "weight" the means in proportion to the 
number of scores in each group, but rather treat the group means as though they 
were based on equal numbers of scores (Pollatsek, Lima, & Well, 1961). 

Pollatsek et al (1981) suggested that subjects who failed to solve weighted 
mean problems Jacked "analog" knowledge of the mean. Such analog knowledge might 
involve a visual or kinesthetic image of the mean as a balance point, a commonly 
used metaphor ar> textbooks. Given two subgroup means based on different numbers 
of observations, the weighted mean or "balance point" must, lie closer to the 
subgroup mean based on the larger number of observations because that mean 
"weighs'* more, Hardiman et al (1984) assessed college students' knowledge of the 
concept of balancing and of the mean. Subjects who were consistently able to 
predict the outcomes of balance problems correctly were also able to solve 
weighted mean problems correctly and could represent weighted mean problems on 
the balance beam. More importantly, subjects who intially performed poorly* on 
both balance beam and weighted mean problems and who were given training on 
balance problems (but not on problems specifically having to do with the mean) 
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subsequently were able to solve weighted mean problems better and with greater 
understanding than control subjects who had not received this training. Thus the 
understanding of an important concept such as the mean can be enhanced through 
training with balance problems. An interesting question is whether acquisition 
of the product -moment rule is necessary in order for transfer to take place. 

Conclusions. Our study of how subjects attempt to make correct predictions 
about balance problems leads us to form several conclusions. Reasoning before 
the product -moment rule has been learned is complex, involving heuristics such as 
ordinal encoding of distance, reasoning from previous problems, and using 
quantitative rules of limited generality, as well as misconceptions about the 
physical nature of the balance beam itself. Not all heuristics seem to be 
employed by all subjects, indicating that there may be different paths toward 
acquiring the product -moment rule. Thus, it is not likely that a simple stage 
analysis or hierarchy of rules can adequately reflect the dynamics and complexity 
of intermediate stages of learning. The use of both limited rules like the ratio 
rule and instance-based reasoning does not fit into the models of SieglBr»s 
<1976) rule-based hierarchy nor does it conform to methods of discovery employed 
by some artificial intelligence programs; e.g., a program receives data in a 
tabular form and systematically reduces it to a single empirical law (Langley, 
1981). 

Brooks' (1978) work suggests tnat in relatively complex situations, 
categorisation based on comparisons with previously experienced exemplars may 
represent less of a load on cognitive resources than an attempt to abstract a 
rule. The central role of critical examples has been acknowledged within the 
domain of mathematics (Michener, 1978) and in the context of learning to classify 
instances generated by a complex rule or set of rules <Brobks, 1978; Reber, 
1976), but not within the context of inducing a relatively simple pnysical law. 
The documentation of the use of instance-based reasoning in the present study 
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suggests that reasonirig from salient examples may be a rather general heuristic 
employed in many types of reasoninq, inducing judgments aDout frequency and 
probability (e.g. Kahnernan, Slovic, and Tversky, 1982), understanding 
mathematics (Michener, 1978) and physics (Clement, 1981), and deducing syntactic 
structure (Reber, 1976), as well as in the discovery of physical laws. 
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Table 1 



Siegler*s 
classification'' 



Inhelder and Piaget*s 
classification 




Stage IP: 



Stage IB: 




Ho<«l 0 



H««ti in 



Stage I IPs 




Stage XIBs 



Subjects fail to distinquish 
their own actjions from external 
processes <e. g. , the subject " 
will push the beam so that it Is 
level and expect it to remain that 
way). 

Subjects realize that weight is 
needed on both sides of the fulcrum 
to achieve balance but there is as 
yet no systematic correspondence 
batween weight and distance. 

Subjects achieve balance by making 
weight and distance both symmet- 
rical. Subjects discover by trial- 
and-error that there is equilibrium 
between a smaller weight at a large 
distance from the fulcrum and a 
greater weight at a small distance 
but do not draw out general conse- 
quences. 

Subjects develop qualitative under- 
standing of the relationship between 
weight and distance. 



H.4*l IS 




Stage IllPs Subjects start to discover the 
quantitative law for balancing^. 
It takes the form of the proposition 
M/U* -L* /L where W and W are two 
unequal weights and L and L* are the 
distances from the fulcru<n at which 
they are placed. 

Stage XI IB: Subjects search for a causal 
explanation. 



« after Klahr and Siegler (1978) 
b "muddle through" means guess 

^ at least for the special case in which weights are placed 
at only one distance on each side of the fulcrum 



ERIC 



38 



